RAW SEQUENCE LISTING 
ERROR REPORT 



-> - , ■ 

BIOTECHNOLOGY 3JJ 

SYSTEMS 
/-J^F^RANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information Center 
(STIC) detected errors when processing the following CRF diskette: 

Application Serial Number: 

Art Unit /Team No. : Q/P£ 

Date Processed by STIC: 1 1/o jOP^P 



THE ATTACHED PRINTOUT EXPLAINS THE ERRORS DETECTED. 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS 
BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
MARK SPENCER 703-308-4212 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



I.BER: . 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA- HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

Wrapped Nucleics the number/text at the end of each line "wrapped" down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Wrapped Aminos The amino acid number/text at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



.1 



Variable Length 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequencefsj 1 contain rvs or Xaa's which represented more than one residue. 
As per the rules, each n or Xaa can only represent a single residue. 
Please present the maximum number of each residue having variable length and 
indicate in the (ix) feature section that some may be missing. 



Patentln ver. 2.0 "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(I) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is Intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence Id number 
<400> sequence id number 
000 



10 Use of n's or Xaa's 

(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



11 Useof <213>Organism Sequence(s), 

(NEW RULES) 



. are missing this mandatory field or its response. 



12 



13 



Useof <220>Feature 
(NEW RULES) 




Sequence(s) are missing the <220>Feature and asso ^at^ h^ 3 ^'"^ — " ~ 

^er*22"0> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown- 
Please explain source of genetic material in <220> to <223> section. 

(See "Fe deral Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 



Patentln ver. 2.0 "bug" Please do not use "Copy to Disk** function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
"V^ Instead, please use "File Manager" or any other means to copy file to floppy disk. 
~* AKS-Biotechnology Systems Branch- 5/15/99 
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RAW SEQUENCE LISTING 



DATE: 03/10/2000 
TIME: 09:51:22 



PATENT APPLICATION US/09/510 , 3.14V 

/■■' * 

Input Set: 1510314. RAW 



This Raw Listing contains the General Information 
Section and up to first 5 pages. 




1 <110> APPLICANT: Zhou, Ming-Ming 

2 Aggarnal, Aneel K 

3 <120> TITLE OF INVENTION: METHODS OF IDENTIFYING MODULATORS OF BROMODOMAINS 

4 <130> FILE REFERENCE: 2459-1-003 

5 <140> CURRENT APPLICATION NUMBER: US/09/510 , 3 14 

6 <141> CURRENT FILING DATE: 2000-02-22 

7 <160> NUMBER OF SEQ ID NOS : 44 HOGS Not Comply 

8 <i7o> software: Patentm ver. 2.0 mrrpcted Diskette Needed 

9 <210> SEQ ID NO 1 COrreCT^U Uiorx 

10 <211> LENGTH: 3014 

11 <212> TYPE: DNA 

12 <213> ORGANISM: Homo sapiens 

13 <400> SEQUENCE : 1 

14 ggggccgcgt cgacgcggaa aagaggccgt ggggggcctc ccagcgctgg cagacaccgt 60 

15 gaggctggca gccgccggca cgcacaccta gtccgcagtc ccgaggaaca tgtccgcagc 120 

16 cagggcgcgg agcagagtcc cgggcaggag aaccaaggga gggcgtgtgc tgtggcggcg 180 

17 gcggcagcgg cagcggagcc gctagtcccc tccctcctgg gggagcagct gccgccgctg 240 

18 ccgccgccgc caccaccatc agcgcgcggg gcccggccag agcgagccgg gcgagcggcg 3 00 

19 cgctaggggg agggcggggg cggggagggg ggtgggcgaa gggggcggga gggcgtgggg 360 

20 ggagggtctc gctctcccga ctaccagagc ccgagggaga ccctggcggc ggcggcggcg 420 

21 cctgacactc ggcgcctcct gccgtgctcc ggggcggcat gtccgaggct ggcggggccg 480 

22 ggccgggcgg ctgcggggca ggagccgggg caggggccgg gcccggggcg ctgcccccgc 540 

23 agcctgcggc gcttccgccc gcgcccccgc agggctcccc ctgcgccgct gccgccgggg 600 

24 gctcgggcgc ctgcggtccg gcgacggcag tggctgcagc gggcacggcc gaaggaccgg 660 

25 gaggcggtgg ctcggcccga atcgccgtga agaaagcgca actacgctcc gctccgcggg 720 

26 ccaagaaact ggagaaactc ggagtgtact ccgcctgcaa ggccgaggag tcttgtaaat 780 

27 gtaatggctg gaaaaaccct aacccctcac ccactccccc cagagccgac ctgcagcaaa 840 

28 taattgtcag tctaacagaa tcctgtcgga gttgtagcca tgccctagct gctcatgttt 900 

29 cccacctgga gaatgtgtca gaggaagaaa tgaaeagact cctgggaata gtattggatg 960 

30 tggaatatct ctttacctgt gtccacaagg aagaagatgc agataccaaa caagtttatt 1020 

31 tctatctatt taagctcttg agaaagtcta ttttacaaag aggaaaacct gtggttgaag 1080 

32 gctctttgga aaagaaaccc ccatttgaaa aacctagcat tgaacagggt gtgaataact 1140 

33 ttgtgcagta caaatttagt cacctgccag caaaagaaag gcaaacaata gttgagttgg 1200 

34 caaaaatgtt cctaaaccgc atcaactatt ggcatctgga ggcaccatct caacgaagac 1260 

35 tgcgatctcc caatgatgat atttctggat acaaagagaa ctacacaagg tggctgtgtt 1320 

36 actgcaacgt gccacagttc tgcgacagtc tacctcggta cgaaaccaca caggtgtttg 1380 

37 ggagaacatt gcttcgctcg gtcttcactg ttatgaggcg acaactcctg gaacaagcaa 1440 

38 gacaggaaaa agataaactg cctcttgaaa aacgaactct aatcctcact catttcccaa 1500 

39 aatttctgtc catgctagaa gaagaagtat atagtcaaaa ctctcccatc tgggatcagg 1560 

40 attttctctc agcctcttcc agaaccagcc agctaggcat ccaaacagtt atcaatccac 1620 

41 ctcctgtggc tgggacaatt tcatacaatt caacctcatc ttcccttgag cagccaaacg 1680 

42 cagggagcag cagtcctgcc tgcaaagcct cttctggact tgaggcaaac ccaggagaaa 1740 

43 agaggaaaat gactgattct catgttctgg aggaggccaa gaaaccccga gttatggggg 1800 

44 atattccgat ggaattaatc aacgaggtta tgtctaccat cacggaccct gcagcaatgc 1860 



V 
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■ /m - 

PAGE: 2 RAW SEQUENCE LISTING ZT DATE : 03/10/2000 

PATENT APPLICATION US/09/510 , TIME: 09:51:22 

Input Set: 1510314. RAW 

45 ttggaccaga gaccaatttt ctgtcagcac actcggccag ggatgaggcg gcaaggttgg 1920 

46 aagagcgcag gggtgtaatt gaatttcacg tggttggcaa ttccctcaac cagaaaccaa 1980 

47 acaagaagat cctgatgtgg ctggttggcc tacagaacgt tttctcccac cagctgcccc 2040 

48 gaatgccaaa agaatacatc acacggctcg tctttgaccc gaaacacaaa acccttgctt 2100 

49 taattaaaga tggccgtgtt attggtggta tctgtttccg tatgttccca tctcaaggat 2160 

50 tcacagagat tgtcttctgt gctgtaacct caaatgagca agtcaagggc tatggaacac 2220 

51 acctgatgaa tcatttgaaa gaatatcaca taaagcatga catcctgaac ttcctcacat 2280 

52 atgcagatga atatgcaatt ggatacttta agaaacaggg tttctccaaa gaaattaaaa 2340 

53 tacctaaaac caaatatgtt ggctatatca aggattatga aggagccact ttaatgggat 2400 

54 gtgagctaaa tccacggatc ccgtacacag aattttctgt catcattaaa aagcagaagg 2460 

55 agataattaa aaaactgatt gaaagaaaac aggcacaaat tcgaaaagtt taccctggac 2520 

56 tttcatgttt taaagatgga gttcgacaga ttcctataga aagcattcct ggaattagag 2580 

57 agacaggctg gaaaccgagt ggaaaagaga aaagtaaaga gcccagagac cctgaccagc 2640 

58 tttacagcac gctcaagagc atcctccagc aggtgaagag ccatcaaagc gcttggccct 2700 

59 tcatggaacc tgtgaagaga acagaagctc caggatatta tgaagttata aggttcccca 2760 

60 tggatctgaa aaccatgagt gaacgcctca agaataggta ctacgtgtct aagaaattat 2820 

61 tcatggcaga cttacagcga gtctttacca attgcaaaga gtacaacgcc gctgagagtg 2880 

62 aatactacaa atgtgccaat atcctggaga aattcttctt cagtaaaatt aaggaagctg 2 940 

63 gattaattga caagtgattt tttttccccc tctgcttctt agaaactcac caagcagtgt 3 000 

64 gcctaaagca aggt 3 014 

65 <210> SEQ ID NO 2 

66 <211> LENGTH: 832 

67 <212> TYPE: PRT 

68 <213> ORGANISM: Homo sapiens 

69 <400> SEQUENCE : 2 

70 Met Ser Glu Ala Gly Gly Ala Gly Pro Gly Gly Cys Gly Ala Gly Ala 

71 1 5 10 15 

72 Gly Ala Gly Ala Gly Pro Gly Ala Leu Pro Pro Gin Pro Ala Ala Leu 

73 20 25 30 

74 Pro Pro Ala Pro Pro Gin Gly Ser Pro Cys Ala Ala Ala Ala Gly Gly 

75 35 40 45 

76 Ser Gly Ala Cys Gly Pro Ala Thr Ala Val Ala Ala Ala Gly Thr Ala 

77 50 55 60 

78 Glu Gly Pro Gly Gly Gly Gly Ser Ala Arg lie Ala Val Lys Lys Ala 

79 65 70 75 80 

80 Gin Leu Arg Ser Ala Pro Arg Ala Lys Lys Leu Glu Lys Leu Gly Val 

81 85 90 95 

82 Tyr Ser Ala Cys Lys Ala Glu Glu Ser Cys Lys Cys Asn Gly Trp Lys 

83 100 105 110 

84 Asn Pro Asn Pro Ser Pro Thr Pro Pro Arg Ala Asp Leu Gin Gin lie 

85 115 120 125 

86 lie Val Ser Leu Thr Glu Ser Cys Arg Ser Cys Ser His Ala Leu Ala 

87 130 135 140 

88 Ala His Val Ser His Leu Glu Asn Val Ser Glu Glu Glu Met Asn Arg 

89 145 150 155 160 

90 Leu Leu Gly lie Val Leu Asp Val Glu Tyr Leu Phe Thr Cys Val His 

91 165 170 175 

92 Lys Glu Glu Asp Ala Asp Thr Lys Gin Val Tyr Phe Tyr Leu Phe Lys 

93 180 185 190 

94 Leu Leu Arg Lys Ser lie Leu Gin Arg Gly Lys Pro Val Val Glu Gly 
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PAGE: 3 RAW SEQUENCE LISTING DATE: 03/10/2000 

PATENT APPLICATION US/09/510 , 314"/ TIME: 09:51:22 

Input Set: 1510314. RAW 



95 






195 










200 










205 








96 


Ser 


Leu 


Glu 


Lys 


Lys 


Pro 


Pro 


Phe 


Glu 


Lys 


Pro 


Ser 


He 


Glu 


Gin 


Gly 


97 




210 










215 










220 










98 


Val 


Asn 


Asn 


Phe 


Val 


Gin 


Tyr 


Lys 


Phe 


Ser 


His 


Leu 


Pro 


Ala 


Lys 


Glu 


99 


225 










230 










235 










240 


100 


Arg 


Gin 


Thr 


He 


Val 


Glu 


Leu 


Ala 


Lys 


Met 


Phe 


Leu 


Asn 


Arg 


He 


Asn 


101 










245 










250 










255 




102 


Tyr 


Trp 


His 


Leu 


Glu 


Ala 


Pro 


Ser 


Gin 


Arg 


Arg 


Leu 


Arg 


Ser 


Pro 


Asn 


103 








260 










265 










270 






104 


Asp 


Asp 


He 


Ser 


Gly Tyr 


Lys 


Glu 


Asn 


Tyr 


Thr 


Arg 


Trp 


Leu 


Cys 


Tyr 


105 






275 










280 










285 








106 


Cys 


Asn 


Val 


Pro 


Gin 


Phe 


Cys 


Asp 


Ser 


Leu 


Pro 


Arg 


Tyr 


Glu 


Thr 


Thr 


107 




290 










295 










300 










108 


Gin 


Val 


Phe 


Gly 


Arg 


Thr 


Leu 


Leu 


Arg 


Ser 


Val 


Phe 


Thr 


Val 


Met 


Arg 


109 


305 










310 










315 










320 


110 


Arg 


Gin 


Leu 


Leu 


Glu 


Gin 


Ala 


Arg 


Gin 


Glu 


Lys 


Asp 


Lys 


Leu 


Pro 


Leu 


111 










325 










330 










335 




112 


Glu 


Lys 


Arg 


Thr 


Leu 


He 


Leu 


Thr 


His 


Phe 


Pro 


Lys 


Phe 


Leu 


Ser 


Met 


113 








340 










345 










350 






114 


Leu 


Glu 


Glu 


Glu 


Val 


Tyr 


Ser 


Gin 


Asn 


Ser 


Pro 


lie 


Trp 


Asp 


Gin 


Asp 


115 






355 










360 










365 








116 


Phe 


Leu 


Ser 


Ala 


Ser 


Ser 


Arg 


Thr 


Ser 


Gin 


Leu 


Gly lie 


Gin 


Thr 


Val 


117 




370 










375 










380 










118 


He 


Asn 


Pro 


Pro 


Pro 


Val 


Ala 


Gly 


Thr 


He 


Ser 


Tyr Asn 


Ser 


Thr 


Ser 


119 


385 










390 










395 










400 


120 


Ser 


Ser 


Leu 


Glu 


Gin 


Pro 


Asn 


Ala 


Gly 


Ser 


Ser 


Ser 


Pro 


Ala 


Cys 


Lys 


121 










405 










410 










415 




122 


Ala 


Ser 


Ser 


Gly 


Leu 


Glu 


Ala 


Asn 


Pro 


Gly 


Glu 


Lys 


Arg 


Lys 


Met 


Thr 


123 








420 










425 










430 






124 


Asp 


Ser 


His 


Val 


Leu 


Glu 


Glu 


Ala 


Lys 


Lys 


Pro 


Arg 


Val 


Met 


Gly 


Asp 


125 






435 










440 










445 








126 


He 


Pro 


Met 


Glu 


Leu 


He 


Asn 


Glu 


Val 


Met 


Ser 


Thr 


lie 


Thr Asp 


Pro 


127 




450 










455 










460 










128 


Ala 


Ala 


Met 


Leu 


Gly Pro 


Glu 


Thr 


Asn 


Phe 


Leu 


Ser 


Ala 


His 


Ser 


Ala 


129 


465 










470 










475 










480 


130 


Arg 


Asp 


Glu 


Ala 


Ala 


Arg 


Leu 


Glu 


Glu 


Arg 


Arg 


Gly Val 


lie 


Glu 


Phe 


131 










485 










490 










495 




132 


His 


Val 


Val 


Gly 


Asn 


Ser 


Leu 


Asn 


Gin 


Lys 


Pro 


Asn 


Lys 


Lys 


He 


Leu 


133 








500 










505 










510 






134 


Met 


Trp 


Leu 


Val 


Gly 


Leu 


Gin 


Asn 


Val 


Phe 


Ser 


His 


Gin 


Leu 


Pro 


Arg 


135 






515 










520 










525 








136 


Met 


Pro 


Lys 


Glu 


Tyr 


He 


Thr 


Arg 


Leu 


Val 


Phe 


Asp 


Pro 


Lys 


His 


Lys 


137 




530 










535 










540 










138 


Thr 


Leu 


Ala 


Leu 


He 


Lys 


Asp 


Gly 


Arg 


Val 


lie 


Gly Gly 


lie 


Cys 


Phe 


139 


545 










550 










555 










560 


140 


Arg 


Met 


Phe 


Pro 


Ser 


Gin 


Gly 


Phe 


Thr 


Glu 


He 


Val 


Phe 


Cys 


Ala 


Val 


141 










565 










570 










575 




142 


Thr 


Ser 


Asn 


Glu 


Gin 


Val 


Lys 


Gly 


Tyr 


Gly 


Thr 


His 


Leu 


Met 


Asn 


His 


143 








580 










585 










590 






144 


Leu 


Lys 


Glu 


Tyr 


His 


He 


Lys 


His 


Asp 


He 


Leu 


Asn 


Phe 


Leu 


Thr 


Tyr 
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PATENT APPLICATION US/09/510 , Z1& TIME: 09:51:22 

Input Set: 1510314. RAW 



145 






595 










600 










605 








146 


Ala 


Asp 


Glu 


Tyr 


Ala 


He 


Gly 


Tyr 


Phe 


Lys 


Lys 


Gin Gly 


Phe 


Ser 


Lys 


147 




610 










615 










620 










148 


Glu 


lie 


Lys 


He 


Pro 


Lys 


Thr 


Lys 


Tyr Val 


Gly Tyr 


He 


Lys 


Asp 


Tyr 


149 


625 










630 










635 










640 


150 


Glu 


Gly 


Ala 


Thr 


Leu 


Met 


Gly 


Cys 


Glu 


Leu 


Asn 


Pro 


Arg 


He 


Pro 


Tyr 


151 










645 










650 










655 




152 


Thr 


Glu 


Phe 


Ser 


Val 


He 


He 


Lys 


Lys 


Gin 


Lys 


Glu 


He 


He 


Lys 


Lys 


153 








660 










665 










670 






154 


Leu 


He 


Glu 


Arg 


Lys 


Gin 


Ala 


Gin 


He 


Arg 


Lys 


Val 


Tyr 


Pro 


Gly 


Leu 


155 






675 










680 










685 








156 


Ser 


Cys 


Phe 


Lys 


Asp Gly 


val 


Arg 


Gin 


He 


Pro 


He 


Glu 


Ser 


He 


Pro 


157 




690 










695 










700 










158 


Gly 


He 


Arg 


Glu 


Thr 


Gly 


Trp 


Lys 


Pro 


Ser 


Gly Lys 


Glu 


Lys 


Ser 


Lys 


159 


705 










710 










715 










720 


160 


Glu 


Pro 


Arg 


Asp 


Pro 


Asp 


Gin 


Leu 


Tyr 


Ser 


Thr 


Leu 


Lys 


Ser 


He 


Leu 


161 










725 










730 










735 




162 


Gin 


Gin 


Val 


Lys 


Ser 


His 


Gin 


Ser 


Ala 


Trp 


Pro 


Phe 


Met 


Glu 


Pro 


Val 


163 








740 










745 










750 






1 

xo *% 


Ltja 


Arg 


Thr 




Ala 


Pro 




Tvr 

iyr 


Tyr 


Glu 


Val 


He 


Arg 


Phe 


Pro 




165 






755 










760 










765 








166 


Asp 


Leu 


Lys 


Thr 


Met 


Ser 


Glu 


Arg 


Leu 


Lys 


Asn 


Arg 


Tyr 


Tyr 


Val 


Ser 


167 




770 










775 










780 










168 


Lys 


Lys 


Leu 


Phe 


Met 


Ala 


Asp 


Leu 


Gin Arg 


Val 


Phe 


Thr 


Asn 


Cys 


Lys 


169 


785 










790 










795 










800 


170 


Glu 


Tyr 


Asn 


Ala 


Ala 


Glu 


Ser 


Glu 


Tyr 


Tyr 


Lys 


Cys 


Ala 


Asn 


He 


Leu 


171 










805 










810 










815 




172 


Glu 


Lys 


Phe 


Phe 


Phe 


Ser 


Lys 


He 


Lys 


Glu 


Ala Gly Leu 


He 


Asp 


Lys 


173 








820 










825 










830 






174 


<210> SEQ 


ID NO 3 





















175 <211> LENGTH: 12 



176 <212> TYPE: PRT \ , ^Ug^/Z^^ £/W^ / Q T^J u 

177 <213> ORGANISM: Artificial Sequence ^-y^ ,/ Jh/ 71 ci °J lyr iuXd 

178 <220> FEATURE: 40*W{ <* Uy ' / ~ £ v 0 i/r***^ 



179 <223> OTHER INFORMATION: Description of Artificial^ Sequence: ( geptid e, 

18? <22?> N^Sy: VARIANT .^iu ,(jL^£^ 

182 <222> LOCATION: (2)^ — - — ~~ ~ ' ~— ^J****** 4 ^ 

183 <223^0THER INFORMStionT It represents 2 to 3 undesignated amino acids. 



_ 3 

184 ( They can be any amino acids. 



185 <220> FEATURE" 
186 



<221> NAME/KEY: VARIANT ' -~.^/0i^^ 

187 <222> LOCATION: (4) 

188 <223> OTHER INFORMATION: It represents 5 to 8 undesignated amino acids. 

189 / They can be any amino acids. 

190 <220>^FEATURE-: " 



191 <221> NAME/KEY: VARIANT 

192 <222> LOCATION: (6) 

193 <223> OTHER INFORMATION: It represents one undesignated amino acid. It can 

194 be any amino acid. 



PAGE: 



195 


<220> 


196 


<221> 


197 


<222> 


198 


<223> 


199 


c 


200 


<220> 


201 


<221> 


202 


<222> 


203 


<223> 


204 




205 


<220> 


206 


<221> 


207 


<222> 


208 


<223> 


209 




210 


<220> 


211 


<221> 


212 


<222> 


213 


<223> 


214 




215 


<400> 


W--> 216 




217 




218 


<210> 


219 


<211> 


220 


<212> 


221 


<213> 


222 


<220> 


223 


<223> 


224 


<220> 


225 


<221> 


226 


<222> 


227 


<223> 


228 

W--jpjU 229 


<400> 


° 230 




231 


<210> 


232 


<211> 


233 


<212> 


A j ft 




235 


<220> 


236 


<223> 


237 


<220> 


238 


<221> 


239 


<222> 


240 


<223> 


W--4)V~ 242 


<400> 


243 






<210> 


Please Note} 



O^PE 



RAW SEQUENCE LISTING j» 
PATENT APPLICATION US/09/510 , 3T4/ 



DATE: 03/10/2000 
TIME: 09:51:22 



FEATURE: 

NAME/KEY: VARIANT 

LOCATION: (9) 
-OTHER INFORMATION: 

be any amino acids < 
""FEATURE : 



(AM' 



Input Set: 1510314. RAW 



It represents 5 undesignated amino acids. They can 



LOCATION: (5) 
OTHER INFORMATION: 
or H. 
FEATURE: 

NAME /KEY: VARIANT 
LOCATION: (8) 
OTHER INFORMATION: 
or H. 
FEATURE: 



It can be any amino acid from the group of: P, K, 



It can be any amino acid from the group of: Y, F, 




It can be any amino acid from the group of: M, I, 



CaaJXaa Xaa Tyr Xaa (Xaa 



Sequence 



Description of Artificial Sequence :( peptide 



It is acetyl -lysine. 



LOCATION: (11) 
OTHER INFORMATION 
or V. 

SEQUENCE: 3 
Phe/Xaa Pro 

SEQ ID NO 4 
LENGTH : 12 
TYPE : PRT 

ORGANISM: Artificial 
FEATURE: 

OTHER INFORMATION 
FEATURE: 
NAME/KEY: SITE 
LOCATION: (6) 
OTHER INFORMATION 
SEQUENCE: 4 

lie Ser Tyr Gly Arg Xaa Lys Arg Arg Gin Arg Arg 

15 10 
SEQ ID NO 5 
LENGTH: 14 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of Artificial Sequence : (peptide 
FEATURE: 
NAME/KEY: SITE 
LOCATION: (8) 

OTHER INFORMATION: It is acetyl - lysine . 
SEQUENCE: 5 

Ala Arg Lys Ser Thr Gly Gly Xaa Ala Pro Arg Lys Gin Leu 

1 5 10 

SEQ ID NO 6 



Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 




PAGE : 6 VERIFICATION SUMMARY DATE: 03/10/2000 

PATENT APPLICATION US/09/510 , 314,-,^ *JIME: 09:51:22 



Input Set: 1510314. RAW 



Line ? Error/Warning Original Text 



216 W "N n or "Xaa" used: Feature required 

229 W n N" or "Xaa" used: Feature required 

242 W n N n or "Xaa" used: Feature required 

255 W "N" or "Xaa" used: Feature required 

1009 W "N" or "Xaa" used: Feature required 



Phe Xaa Pro Xaa Xaa Xaa Tyr Xaa Xaa Pro X 
lie Ser Tyr Gly Arg Xaa Lys Arg Arg Gin A 
Ala Arg Lys Ser Thr Gly Gly Xaa Ala Pro A 
Gin Ser Thr Ser Arg His Lys Xaa Leu Met P 
Xaa Phe Xaa Pro Xaa Xaa Xaa Tyr Xaa Xaa P 



V 



